Complexity of Classiication Problems and Comparative Advantages of Combined Classiiers

نویسنده

  • Tin Kam Ho
چکیده

We studied several measures of the complexity of classiica-tion problems and related them to the comparative advantages of two methods for creating multiple classiier systems. Using decision trees as prototypical classiiers and bootstrapping and subspace projection as classiier generation methods, we studied a collection of 437 two-class problems from public databases. We observed strong correlations between classiier accuracies, a measure of class boundary length, and a measure of class manifold thickness. Also, the bootstrapping method appears to be better when subsamples yield more variable boundary measures and the subspace method excels when many features contribute evenly to the discrimination.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pattern Extraction for Time Series Classi

In this paper, we propose some new tools to allow machine learning classiiers to cope with time series data. We rst argue that many time-series classiication problems can be solved by detecting and combining local properties or patterns in time series. Then, a technique is proposed to nd patterns which are useful for classiication. These patterns are combined to build interpretable classiicatio...

متن کامل

A comparative study of multispectral image classifiers: Applications to classification problems with high-dimensional data and high-overlapping spectral signatures. Simulation of . . .

A comparative study of multispectral image classiiers: Applications to classiication problems with high-dimensional data and high-overlapping spectral signatures. Simulation of high-dimensional images. Abstract| Classiication of high-dimensional images is of the almost interest in Remote Sensing applications. Storage space, and specially the computational eeort required for classifying this kin...

متن کامل

Adaptive Selection Of Image Classi

Recently, the concept of \Multiple Classiier Systems" was proposed as a new approach to the development of high performance image classiication systems. Multiple Classiier Systems can be used to improve classiication accuracy by combining the outputs of classiiers making \uncorrelated" errors. Unfortunately, in real image recognition problems, it may be very diicult to design an ensemble of cla...

متن کامل

The Performance of Statistical Pattern Recognition Methods in High Dimensional Settings

We report on an extensive simulation study comparing eight statistical classiication methods, focusing on problems where the number of observations is less than the number of variables. Using a wide range of artiicial and real data, two types of classiiers were contrasted; methods that classify using all variables, and methods that rst reduce the number of dimensions to two or three. The full f...

متن کامل

Design and Evaluation of Neural Classiiers Application to Skin Lesion Classiication

We address design and evaluation of neural classiiers for the problem of skin lesion classiication. By using Gauss Newton optimization for the entropic cost function in conjunction with pruning by Optimal Brain Damage and a new test error estimate, we show that this scheme is capable of optimizing the architecture of neural classiiers. Furthermore, error-reject tradeoo theory indicates, that th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000